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Abstract 

A multivariate polynomial p(x) — p(x\, . . . ,x n ) is sos-convex if its Hessian H(x) can be 
factored as H{x) — M T (x)M(x) with a possibly nonsquare polynomial matrix M(x). It is 
easy to see that sos-convexity is a sufficient condition for convexity of p(x). Moreover, the 
problem of deciding sos-convexity of a polynomial can be cast as the feasibility of a semidefinite 
program, which can be solved efficiently. Motivated by this computational tractability, it has 
been recently speculated whether sos-convexity is also a necessary condition for convexity of 
polynomials. In this paper, we give a negative answer to this question by presenting an explicit 
example of a trivariate homogeneous polynomial of degree eight that is convex but not sos- 
convex. Interestingly, our example is found with software using sum of squares programming 
techniques and the duality theory of semidefinite optimization. As a byproduct of our numerical 
procedure, we obtain a simple method for searching over a restricted family of nonnegative 
polynomials that are not sums of squares. 

1 Introduction 

In many problems in applied and computational mathematics, we would like to decide whether a 
multivariate polynomial is convex or to parameterize a family of convex polynomials. Perhaps the 
most obvious instance appears in optimization. It is well known that in the absence of convexity, 
global minimization of polynomials is generally NP-hard [23], [21], [17]. However, if we somehow 
know a priori that the polynomial is convex, nonexistence of local minima is guaranteed, and simple 
gradient descent methods can find a global minimum. In many other practical settings, we might 
want to parameterize a family of convex polynomials that have certain properties, e.g., that serve 
as a convex envelope for a non-convex function, approximate a more complicated function, or fit 
some data points with minimum error. To address many questions of this type, we need to have 
an understanding of the algebraic structure of the set of convex polynomials. 

Over a decade ago, Pardalos and Vavasis [19] put the following question proposed by Shor on the 
list of seven most important open problems in complexity theory for numerical optimization: "Given 
a degree-4 polynomial in n variables, what is the complexity of determining whether this polynomial 
describes a convex function?" To the best of our knowledge, the question remains open but the 
general belief is that the problem should be hard (see the related work in [8] ) . Not surprisingly, if 
testing membership to the set of convex polynomials is hard, searching and optimizing over them 
also turns out to be a hard problem. 

The notion of sos-convexity has recently been proposed as a tractable relaxation for convexity 
based on semidefinite programming. Broadly speaking, the requirement of positive semidefiniteness 
of the Hessian matrix is replaced with the existence of an appropriately defined sum of squares 
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decomposition. As we will briefly review in this paper, by drawing some appealing connections 
between real algebra and numerical optimization, the latter problem can be reduced to the feasibility 
of a semidefinite program. Besides its computational implications, sos-convexity is an appealing 
concept since it bridges the geometric and algebraic aspects of convexity. Indeed, while the usual 
definition of convexity is concerned only with the geometry of the epigraph, in sos-convexity this 
geometric property (or the nonnegativity of the Hessian) must be certified through a "simple" 
algebraic identity, namely the sum of squares factorization of the Hessian. 

Despite the relative recency of the concept of sos-convexity, it has already appeared in a number 
of theoretical and practical settings. In [9], Helton and Nie use sos-convexity to give sufficient con- 
ditions for semidefinite representability of semialgebraic sets. In [13], Lasserre uses sos-convexity to 
extend Jensen's inequality in convex analysis to linear functionals that are not necessarily probabil- 
ity measures, and to give sufficient conditions for a polynomial to belong to the quadratic module 
generated by a set of polynomials [12]. More on the practical side, Magnani, Lall, and Boyd |15] 
have used sum of squares programming to find sos-convex polynomials that best fit a set of data 
points or to find minimum volume convex sets, given by sub-level sets of sos-convex polynomials, 
that contain a set of points in space. 

Even though it is well-known that sum of squares and nonnegativity are not equivalent, because 
of the special structure of the Hessian matrix, sos-convexity and convexity could potentially turn 
out to be equivalent. This speculation has been bolstered by the fact that finding a counterexample 
has shown to be difficult and attempts at giving a non-constructive proof of its existence have seen 
no success either. Our contribution in this paper is to give the first such counterexample, i.e., the 
first explicit example of a polynomial that is convex but not sos-convex. This example is presented 
in Theorem 13.21 Our result further supports the hypothesis that deciding convexity of polynomials 
should be a difficult problem. We hope that our counterexample, in a similar way to what other 
celebrated counterexamples |16[ [29l [321 H] have achieved, will help stimulate further research and 
clarify the relationships between the geometric and algebraic aspects of positivity and convexity. 

The organization of the paper is as follows. Section [2] is devoted to mathematical preliminaries 
required for understanding the remainder of this paper. We begin this section by introducing the 
cone of nonnegative and sum of squares polynomials. We briefly discuss the connection between 
sum of squares decomposition and semidefinite programming highlighting also the dual problem. 
Formal definitions of sos-convex polynomials and sos-matrices are also given in this section. In 
Section [31 we present our main result, which is an explicit example of a convex polynomial that is 
not sos-convex. Finally, we explain in Section H] how we have utilized sos-programming together 
with semidefinite programming duality theory to find the example presented in Section [3l We 
comment on how one can use the same methodology to search for a restricted class of nonnegative 
polynomials that are not sos. 

2 Background 

2.1 Nonnegativity and sum of squares 

We denote by K[x] := . . . ,x n ] the ring of polynomials in n variables with coefficients in the 

field K. Throughout the paper, we will have IK = R or 1C = Q. A polynomial p{x) € M[x] is said 
to be nonnegative or positive semidefinite (psd) if p(x) > for all x E W 1 . Clearly, a necessary 
condition for a polynomial to be psd is for its total degree to be even. We say that p{x) is a sum 
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of squares (sos), if there exist polynomials qi(x), ...,q m (x) such that 



in 



p( x ) = ^2<ii(x)- 



(i) 



i=i 



It is clear that p(x) being sos implies that p(x) is psd. In 1888, David Hilbert [10] proved that the 
converse is true for a polynomial in n variables and of degree d only in the following cases: 

• n = 1 (univariate polynomials of any degree) 

• d = 2 (quadratic polynomials in any number of variables) 

• n = 2, d = 4 (bivariate quartics) 

Hilbert showed that in all other cases there exist polynomials that are psd but not sos. Explicit 
examples of such polynomials appeared nearly 80 years later, starting with the celebrated example 
of Motzkin, followed by more examples by Robinson, Choi and Lam, and Lax-Lax and Schmudgen. 
See [28] for an outstanding exposition of these counterexamples. 

A polynomial p(x) of degree d in n variables has I = ( n ~j i d ) coefficients and can therefore be 
identified with the /-tuple of its coefficients, which we denote by p £ K*. A polynomial where all 
the monomials have the same degree is called a form. A form p{x) of degree d is a homogenous 



forms in n variables of degree d is denoted by Ti nt d- It is easy to show that if a form of degree d is 
sos, then d is even, and the polynomials qi in the sos decomposition are forms of degree d/2. We 
also denote the set of psd (resp. sos) forms of degree d in n variables by P n ,d (resp. S n ^). Both 
Pn t d and S ni d are closed convex cones [28J, and we have the relation E n ^ C P n d c TL n ,d- 

Any form of degree d in n variables can be dehomogenized into a polynomial of degree < d in 
n — 1 variables by setting x n = 1. Conversely, any polynomial p of degree d in n variables can be 
homogenized into a form p n of degree d in n + 1 variables, by adding a new variable y, and letting 



The properties of being psd and sos are preserved under homogenization and dehomogenization. 

An important related problem is Hilbert's 17th problem, which asks if every psd form must 
be a sum of squares of rational functions. In 1927, Artin [1] answered Hilbert's question in the 
affirmative. This result implies that if a polynomial p(x) is psd, then there must exist an sos 
polynomial g(x), such that p{x)g{x) is sos. Moreover, Reznick showed in |27] that if p{x) is positive 
definite, one can always take g(x) = (^ xf ) r , for sufficiently large r. We will make use of this key 
fact in the derivation of our example. 

To make the ideas presented so far more concrete, we end this section by discussing the example 
of Motzkin. Interestingly enough, this example will reappear in Section H] where it serves as a 
starting point in the numerical procedure that leads to our example. 

Example 2.1. The Motzkin polynomial 



is historically the first known example of a polynomial that is psd but not sos. Positive semidefinite- 
ness follows from the arithmetic-geometric inequality, and the nonexistence of an sos decomposition 



function of degree d (since 




coefficients. The set of 





(2) 
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can be shown by some clever algebraic manipulations (see [28J for details). We can homogenize this 
polynomial and obtain the Motzkin form 

Mh(x\,X2, X3) := x\x\ + x\x\ — ?>x\x\x\ + x\, (3) 

which belongs to P^^ \ as expected. An alternative proof of nonnegativity of (resp. M) 
is obtained by showing that Mh(x)(x\ + x\ + x\) (resp. M{x){x\ + x\ + 1)) is sos. An explicit sos 
decomposition can be found in |20j . 

In the sequel, we will explain how we can give an alternative proof of the fact that the Motzkin 
polynomial is not sos, by appealing to sos-programming duality. 

2.2 Sum of squares, semidefinite programming, and duality 

Deciding nonnegativity of polynomials is an important problem that arises in many areas of applied 
and computational mathematics. Unfortunately, this problem is known to be NP-hard even when 
the degree of the polynomial is equal to four [21], [T7]. On the other hand, deciding whether a 
given polynomial admits an sos decomposition turns out to be a tractable problem. This tractability 
stems from the underlying convexity of the problem as first pointed out in [33], [E], [20]. More 
specifically, it was shown in [20] that one can reduce the problem of deciding whether a polynomial is 
sos to feasibility of a semidefinite "program (SDP). Semidefinite programs are a well-studied subclass 
of convex optimization problems that can be efficiently solved in polynomial time using interior 
point algorithms. Because our space is limited, we refrain from further discussing SDPs and refer 
the interested reader to the review papers [361 135j . The main theorem that establishes the link 
between sum of squares and semidefinite programming is the following. 

Theorem 2.1 (|20j,[21j). A multivariate polynomial p{x) in n variables and of degree 2d is a 
sum of squares if and only if there exists a positive semidefinite matrix Q ( often called the Gram 
matrix) such that 

p(x) = z T Qz, (4) 
where z is the vector of monomials of degree up to d 

Z = [1, XI, X 2 , ■ ■ ■ , X n , X X X 2 , . . . , X*]. 

Given a polynomial p(x), by expanding the right hand side of (j4|) and matching coefficients 
of p, we obtain linear constraints on the entries of Q. We also have the constraint that Q must 
be a positive semidefinite 

(PSE0) 

matrix. Therefore, the set described by these constraints is the 
intersection of an affine subspace with the cone of PSD matrices. This is exactly the structure of 
the feasible set of a semidefinite program [36]. Since the entries of the vector of monomials z can 
be algebraically dependent, the matrix Q in the representation (jlj) is not in general unique. The 
size of the matrix Q depends on the size of the vector of monomials. When there is no sparsity 
to be exploited Q will have dimensions X • If the polynomial p(x) is homogeneous of 

degree 2d, then it suffices to consider in (j3|) a vector z of monomials of degree exactly d. This will 
reduce the size of Q to x 

The conversion step of going from an sos decomposition problem to an SDP problem is fully 
algorithmic and has been implemented in software packages such as SOSTOOLS [26] and YALMIP 
[14] . For instance, we can input a polynomial p(x) into SOSTOOLS and (if p(x) is sos) it will 

lr To avoid potential confusion, we use the abbreviation psd for positive semidefinite polynomials and PSD for 
positive semidefinite matrices. We also denote a PSD matrix A with the standard notation A y 0. 
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return a matrix Q and a vector of monomials z. Since Q is PSD, one can compute its Cholesky 
factorization Q = V T V, which immediately gives rise to an explicit sos decomposition 

p(x) = 

i 

Solutions returned from interior point algorithms are numerical approximations computed via 
floating point manipulations. In many applications in mathematics where the goal is to formally 
prove a theorem (as is the case in this paper), it is required to get an exact algebraic solution. 
What we mean by this is that given a polynomial p(x) € Q[x], i.e., a polynomial with rational 
coefficients, we would like to compute a rational sos decomposition, i.e., a decomposition only 
consisting of squares of polynomials in Q[x], This issue has been studied in detail in [25] where 
it is shown that the existence of a rational sos decomposition is equivalent to the existence of a 
Gram matrix with rational entries. SOSTOOLS is endowed with a feature that computes rational 
decompositions. The work in [25] proposes an efficient mixed symbolic-numerical approach for this 
purpose and has been separately implemented in the package SOS.m2 [24] for the computer algebra 
system Macaulay 2 [fj. 

Putting the issue of exact computation aside, there are two other key aspects of sum of squares 
programming that we would like to highlight. First, it is not difficult to see that the same method- 
ology can be used to search over sos polynomials in a convex family of polynomials or even optimize 
a linear functional over them [20] . This idea will be crucial in Section when we are searching for 
our desired polynomial through sos-programming. 

The second valuable feature of sos-programming is that when the semidefinite program arising 
from Theorem 12. II is infeasible, we get a certificate that the polynomial is not sos (though it might 
still be psd). This certificate is readily given to us by a feasible solution of the dual semidefinite 
program. By definition, the dual cone S* . of the sum of squares cone S n ^ is the set of all linear 
functionals that take nonnegative values on it, i.e., 

K 4 ■= e n* n4 , (v, P ) > o v P g E n , rf }. 

Here, the dual space H* n d denotes the space of all linear functionals on H n> d, and (., .) represents 
the pairing between elements of the primal and the dual space. If a polynomial is not sos, we can 
find a dual functional fj, £ E* d that separates it from the closed convex cone The basic idea 

behind this is the well known separating hyperplane theorem in convex analysis; see e.g. [U [30]. In 
Section El we will see a concrete example of the use of duality when we prove that our polynomial 
is not sos-convex. For a more thorough treatment of the duality theory in semidefinite and sum of 
squares programming, we refer to reader to references [36] and [21] . respectively. 

2.3 Sum of squares matrices and sos-convexity 

The notions of positive semidefiniteness and sum of squares of scalar polynomials can be naturally 
extended to polynomial matrices, i.e., matrices with entries in R[x]. We say that a symmetric 
polynomial matrix P(x) <G M[x] mxm is PSD if P(x) is PSD for all x £ R n . It is straightforward to 
see that this condition holds if and only if the polynomial y T H(x)y inm + n variables [x; y] is psd. 
The definition of an sos-matrix is as follows [TT] , [6] , [31] . 

Definition 2.2. A symmetric polynomial matrix P(x) € M[x] mxm , x 6 M n is an sos-matrix if 
there exists a polynomial matrix M(x) € M[x] sxm for some s £ N, such that P{x) = M T (x)M(x). 

Lemma 2.3. A polynomial matrix P{x) G M[x] mxrn , x £ M n is an sos-matrix if and only if the 
scalar polynomial y T P(x)y is a sum of squares in M[x;y] . 
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Proof. One direction is trivial: if P(x) admits the factorization M T (x)M(x), then the scalar poly- 
nomial y T M T (x)M(x)y = (M (x)y) T (M (x)y) is clearly sos. For the reverse direction see □ 



Lemma 12.31 enables us to easily check whether a given polynomial matrix is an sos-matrix 
with the machinery explained in Section [2.21 Remarkably, in the univariate case {x € R), any PSD 



polynomial matrix P(x) E 



is an sos-matrix; see e.g. [5]. For more details about univariate 



polynomial matrices, references to the literature, as well as an efficient eigenvalue-based method 
for finding their sos decomposition, we refer the reader to [2]. 

In the multivariate case, however, not every PSD polynomial matrix must be an sos-matrix. 
The first counterexample is due to Choi [I]. Even though Choi did not have polynomial matrices 
in mind, in [4j he showed that not every psd biquadratic form is a sum of squares of bilinear forms. 
His counterexample can be rewritten as the following polynomial matrix 



C{x) 



x\ + 2x\ —X\X2 



-X\X2 x\ + 2x\ 



-XIX 3 

2 1 0^2 



-x^xj, X3 + 2x( 



(5) 



which is PSD for all x € M 3 but is not an sos-matrix. 

We will now specialize polynomial matrices to Hessians, and discuss convexity of polynomials. 
It is well known that a polynomial p(x) ■= p(xi, ■ ■ ■ , x n ) is convex if and only if its Hessian 



H(x) 



a 2 P 

r).rf 
8x2 dxi 



d 2 p 



d 2 p 
dxi 8x2 

8 2 p 
dxi 



8 2 p 



dp 
dxi dx n 

d 2 p 
8x2 dx„ 



d 2 p 



(6) 



. dx n dxi dx n 8x2 
is PSD for all x 6 M n , i.e., is a PSD polynomial matrix. 

Definition 2.4 ([9]). A polynomial p(x) is sos-convex if its Hessian H(x) is an sos-matrix. 

Even though we know that not every PSD polynomial matrix is an sos-matrix, it has been 
speculated that because of the special structure of the Hessian as the matrix of the second deriva- 
tives, convexity and sos-convexity of polynomials could perhaps be equivalent. We will show in the 
next section that this is not the case. Note that the example of Choi in ([5]) does not serve as a 
counterexample. The polynomial matrix C{x) in ([5j) is not a valid Hessian, i.e., it cannot be the 
matrix of the second derivatives of any polynomial. If this was the case, the third partial derivatives 
would commute. However, we have in particular 



dC hl (x) 
dx 3 



0^-x 3 



dx\ 



In [5], Choi, Lam, and Reznick generalize the earlier results of Choi [1] and provide more 
examples of psd multiforms that are not sos. Some of their examples can be rewritten as PSD 
polynomial matrices that are not sos-matrices. In a similar fashion, we can show that these matrices 
also fail to be valid Hessians. 
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3 A polynomial that is convex but not sos-convex 



We start this section with a lemma that will appear in the proof of our main result. 

Lemma 3.1. If P(x) E M[x]" ixm is an sos-matrix, then all its 2 m — 1 principal minor^ are sos 
polynomials. In particular, det(P) and the diagonal elements of P must be sos polynomials. 

Proof. We first prove that det(P) is sos. By Definition 12.21 we have P(x) = M T (x)M(x) for some 
s x m polynomial matrix M(x). If s = m, we have 

det(P) = det(M T ) det(M) = (det(M)) 2 

and the result is immediate. If s > m, the result follows from the Cauchy-Binet formula!!. We have 

det(P) = £ s det(M T ) s det(M s ) 

= J2 s det(M s ) T det(M s ) 

= £ S (det(M s )) 2 . 

Finally, when s < m, det(-P) is zero which is trivially sos. In fact, the Cauchy-Binet formula also 
holds for s = m and s < m, but we have separated these cases for clarity of presentation. 

Next, we need to prove that the smaller minors of P are also sos. Define Ai = {1, . . . , m}, and 
let I and J be nonempty subsets of M. Denote by Pjj a sub-block of P with row indices from I 
and column indices from J. It is easy to see that 

Pjj = (M T ) JM M MJ = (M MJ ) T M MJ . 

Therefore, Pjj is an sos-matrix itself. By the proceeding argument det(Pjj) must be sos, and 
hence all the principal minors are sos. □ 

Remark 3.1. The converse of Lemma 13.11 does not hold. The Choi matrix ([5]) serves as a coun- 
terexample. It is easy to check that all 7 principal minors of C(x) are sos polynomials and yet it 
is not an sos-matrix. This is in contrast with the fact that a polynomial matrix is PSD if and only 
if all its principal minors are psd polynomials. The latter statement follows almost immediately 
from the well-known fact that a constant matrix is PSD if and only if all its principal minors are 
nonnegative. 

We are now ready to state our main result. 

Theorem 3.2. There exists a polynomial that is convex but not sos-convex. In particular, the 
trivariate form of degree 8 given by 

p(x) = 32xf + 118a;? x\ + AOxfxl + 25xjx% - AZx\x\x\ - 35x|x| + ?>x\x\x\ 

" " (7) 

-\§x\x\x\ + 2kx\x% + 16x| + ^x\x\ + 70x|x| + 60x|x| + 30x| 

has these properties. 

2 We remind the reader that the principal minors of an m x m matrix A are the determinants of all k x k (1 < k < m) 
sub-blocks whose rows and columns come from the same index set S C {1, . . . , m}. 

3 Given matrices A and B of size m x s and s x m respectively, the Cauchy-Binet formula states that 

det(AB) = ^det(yl s )det(Bs), 

s 

where S is a subset of {1, . . . , s} with m elements, As denotes the m x m matrix whose columns are the columns of 
A with index from S, and similarly Bs denotes the m x m matrix whose rows are the rows of B with index from S. 
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Proof. Let H(x) denote the Hessian of p{x). Convexity follows from the fact that 

{x\ + x 2 2 + xl)H{x) = M T (x)M(x), (8) 

for some polynomial matrix M(x). Equivalently, 

(xl + xl + xl)y T H{x)y (9) 

is a sum of squares in M.[x; y], which shows that H(x) is a PSD polynomial matrix. In the Appendix, 
we provide an explicit sos representation in terms of rational Gram matrices for the polynomial 
([9]). This representation was found using SOSTOOLS along with the SDP solver SeDuMi [34] . 
To prove that p(x) is not sos-convex, by Lemma 13. II it suffices to show that 



H 1A (x) = |jj = 1792a?f + SMOxjxl + 1200xfx| + 300x 2 x% 

- h\§x\x\x\ - 420xlxj + §x\x\ - 32x\x\ + 4&cjj 



(10) 



is not sos (though it must be psd because of convexity). Define the subspace S C 7^3,6 as 

c. r6 42 42 24 222 24 42 24 6l 

' nri o n J T* T* ry" ry -*- ry" ry ry ry" f ry ry" <"y* /"y -*- ry" ry ry *■ ry " I (III 

o •— bpdii^x^, x 1 x 2 , ^1^3) •t'i't'2' X 1 X 2 X 3' x l x 3; X 2 X 3' X 2 X 3' X 3J ) V 11 / 

which are the trivariate sextic forms containing only the monomials in (jlip . Note that belongs 
to S. We will prove that H\ } \ is not sos by presenting a dual functional £ that separates iif^i from 

s 3 , 6 n5. 

Consider the vector of coefficientqj of H\ i with the ordering as written in (jlOf) : 



Hf A = [1792, 3540, 1200, 300, -516, -420, 6, -32, 48]. 
Using the same ordering, we can represent our dual functional £ with the vector 

c T = [0.039, 0.051, 0.155, 0.839, 0.990, 1.451, 35.488, 20.014, 17.723], 
which will serve as a separating hyperplane. We have 

(f , Hi ti ) = c T H ltl = -8.948 < 0. (12) 
On the other hand, we claim that for any form w S S 3 6 n 5, we should have 

(f, w ) = c T w > 0. (13) 
Indeed, if w is sos, by Theorem 12.11 it can be written in the form 

w(x) = z T Qz = Tr Q ■ zz 1 ' , 
for some Q >z 0, and a vector of monomials 

Z T = [x\,X 1 xl,X 1 xl,x\x2,X2xl,X 1 X2X2,,X2,x\,XzX 2 \,x\] 

that includes all monomials of degree 3 except for x 2 , which is not required. It is not difficult to 
see that 

c T w = TtQ ■ (zz T )\ c , (14) 



4 As a trivariate form of degree 6, should have 28 coefficients. We refrain from showing the coefficients that 
are zero since our analysis is done in the lower dimensional subspace S. 



where by (zz T )\ c we mean a matrix where each monomial in zz T is replaced with the corresponding 
element of the vector c (or zero, if the monomial is not in S). This yields the matrix 
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and we can easily check that it is positive definite. Therefore, equation (|14j) along with the fact 
that Q is positive semidefinite implies that (|13p holds. This completes the proof. □ 

We end this section with a few remarks on some of the properties of the polynomial p(x) in (J7|). 

Remark 3.2. The Gram matrices in the sos decomposition of © (presented in the Appendix) are 
positive definite. This shows that for all nonzero x, the Hessian H{x) is positive definite and hence 
p(x) is in fact strictly convex; i.e., 

p(Xx + (1 - X)y) < Xp{x) + (1 - X)p{y) Vx, y e M. 3 and A <E (0, 1). 

Remark 3.3. Because of strict convexity and the fact that H\i is strictly separated from E3 g (see 
(|12jl ). it follows that p(x) is in the interior of the set of trivariate forms of degree 8 that are convex 
but not sos-convex. In other words, there exists a neighborhood of polynomials around p(x), such 
that every polynomial in this neighborhood is also convex but not sos-convex. 

Remark 3.4. As explained in Section [2.11 we can dehomogenize the form in (J7|) into a polynomial 
in two variables by letting 

Pdh(xi,x 2 ) := p(x 1 ,x 2 ,l)- (15) 

The bivariate polynomial p^h has degree 8 and we can check that it is still convex but not sos- 
convex. It is interesting to note that p^h is an example with the minimum possible number of 
variables since we know that all convex univariate polynomials are sos-convex. As for minimality in 
the degree, we do not know if an example with lower degree exists. However, we should note that 
a bivariate form of degree 4 cannot be convex but not sos-convex. The reason is that the entries 
of the Hessian of such polynomial would be bivariate quadratic forms. It is known that a matrix 
with such entries is PSD if and only if it is an sos-matrix [3]. 

Remark 3.5. Unlike nonnegativity and sum of squares, sos-convexity may not be preserved under 
homogenization. To give a concrete example, one can check that Pdh( x 2, x 3) := p(l, x 2 , x%) is 
sos-convex, i.e., the 2x2 Hessian of pdh( x 2, ^3) is an sos-matrix. 

Remark 3.6. It is easy to argue that the polynomial p in ^ must itself be nonnegative. Since 
p is strictly convex, it has a unique global minimum. Clearly, the gradient of p has no constant 
terms and hence vanishes at the origin. Therefore, x = must be the unique global minimum of p. 
Because we have p(0) = 0, it follows that p is in fact positive definite. 

Remark 3.7. In [9], Helton and Nie prove that if a nonnegative polynomial is sos-convex, then it 
must be sos. Since p is not sos-convex, we cannot directly use their result to claim that p is sos. 
However, we have independently checked that this is the case simply by getting an explicit sos 
decomposition of p using SOSTOOLS. 



9 



4 Our procedure for finding the example 



As we mentioned in Section \2.2\ one of the main strengths of sos-programming is in its ability to 
search over sos polynomials in a convex family of polynomials. Our main example in (|7|) has in fact 
been found by solving an sos-program. In this section, we explain how this has been exactly done. 

The task of finding a polynomial p(x) that is convex but not sos-convex is equivalent to finding 
a polynomial matrix H(x) that is a valid Hessian (i.e., it is a matrix of second derivatives), and 
satisfies the following requirement on the scalar polynomial y T H(x)y in [x;y]: 

y T H(x)y must be psd but not sos. (16) 

Indeed, if such a matrix H(x) is found, the desired polynomial p(x) can be recovered from it by 
integration. Unfortunately, a constraint of type (|16|) that requires a polynomial to be psd but 
not sos cannot be easily handled, since it is a non-convex constraint. This is easy to see from 
a geometric viewpoint, since as Theorem 12.11 suggests, an sos-program can be converted to an 
equivalent semidefinite program. We know that the feasible set of a semidefinite program is always 
a convex set. On the other hand, for a fixed degree and dimension, the set of psd polynomials 
that are not sos is generally non-convex. Nevertheless, we are going to see that by making use of 
dual functionals of the sos cone along with Reznick's result on Hilbert's 17th problem [27], we can 
formulate an sos-program that searches over a convex subset of the set of polynomials that are psd 
but not sos. The idea behind our algorithm closely resembles the proof of Theorem 13.21 

The algorithm. The sos-program which has led to our main result in ([7]) can be written in 
pseudo-code as follows. 



SOS-Program 1 



1: Parameterize p(x) as a form of degree 8 in 3 variables. 
2: Compute the Hessian H(x) = ^S- 
3: Impose the constraint 



(xj + x\ + x\) r y T H{x)y sos. (17) 

for some integer r > 1. 
4: Impose the constraint 

(M,ffi,i><0. (18) 



for some (carefully chosen) dual functional /x G S 



3,6- 



The decision variables of this sos-program are the coefficients of the polynomial p{x) that also 
appear in the entries of the Hessian matrix H(x). The scalar r and the dual functional /i must be 
fixed a priori as explained in the sequel. Note that the constraints (fTTj) and (fl"8|) are linear in the 
decision variables and indeed the feasible set described by these constraints is a convex set. 

We claim that if this sos-program is feasible, the solution p(x) will be convex but not sos-convex. 
The requirement of H (x) being a valid Hessian is met by construction since H{x) is obtained by 
twice differentiating a polynomial. It is also easy to see that if the constraint in (|17p is satisfied, 
then y T H(x)y will be psd. The same implication would hold if instead of {x\-\-x^+x\) r we used any 
other positive definite polynomial. As discussed in Section 12.11 the reason for using this particular 
form is due to the result of Reznick [27] . which states that if y T H(x)y is positive definite, then (|17p 
must be satisfied for sufficiently large r. In our case, it was sufficient to take r = 1. 

In order to guarantee that y T H{x)y is not sos, by Lemma 13. II it suffices to require at least one 
of the principal minors of H(x) not to be sos (though they must all be psd because of (fl~7l) and 
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Remark I3.ip . The constraint in (|18p is imposing this requirement on the first diagonal element 
Hi \. Since p{x) is a form of degree 8 in 3 variables, H\ i will be a form of degree 6 in 3 variables. 
The role of the dual functional fi G £| 6 in ([18]) is to separate H\ \ from £3,6- Once an ordering on 
the monomials of H\ \ is fixed, the inequality in (|18p can be written as 

{fi,H ltl ) = b T H 1A <0, (19) 

where 6 G M 28 represents our separating hyperplane and must be fixed a priori. We explain next 
our specific choice of the dual functional fi. 

Finding a separating hyperplane. There are several ways to obtain a separating hyperplane 
for £3^. In particular, we can find a dual functional that separates the Motzkin form in @ from 
£3,6- This can be done in at least a couple of different ways. For example, we can formulate a 
semidefinite program that requires the Motzkin form to be sos. This program is clearly infeasible. 
A feasible solution to its dual semidefinite program will give us the desired separating hyperplane. 
Most SDP solvers, such as SeDuMi, use primal-dual interior point algorithms to solve an SDP. 
Therefore, once the primal SDP is infeasible, a dual feasible solution can readily be recovered from 
the solver. 

Another way to obtain a separating hyperplane for the Motzkin form M/j(x) is to find its 
(Euclidean) projection Mj^(x) onto the cone £3,6- Since the projection is done onto a convex set, 
the hyperplane tangent to £3,6 at M^(x) will be supporting £3,6- The projection M^(x) can be 
obtained by searching for an sos polynomial that is closest in the 2-norm of the coefficients to the 
Motzkin form. This search can be formulated as the following sos-program: 

min \\M h (x) - q{x)\\ 2 ^ 

subject to q(x) sos. 

Here, q{x) is parameterized as a degree 6 form in 3 variables. The objective function in (I20p can 
be converted to a semidefinite constraint using standard tricks; see e.g. |36| . 

We have used SOSTOOLS and SeDuMi to obtain a feasible solution to SOS-Program CD with 
r = 1 and the dual function [i computed using the projection approach described above. In order 
to end up with the form in (|T|) which has integer coefficients, some post-processing has been done 
on this feasible solution. This procedure includes truncation of the coefficients and some linear 
coordinate transformations. 

We shall end our discussion with a couple of remarks. 

Remark 4.1. If the constraints and the objective function of a semidefinite program possess some 
type of symmetry, the same symmetry will generally be inherited in the solution returned by interior 
point algorithms. For example, consider the sos-program in (|20p . The Motzkin form M^x) is 
symmetric in x\ and x 2 ; see ([3]). Therefore, it turns out that the optimal solution M^(x) is also 
symmetric in x\ and x 2 . On the other hand, our main example p{x) in (|7j) and its dehomogenized 
version pdh{ x ) m (|15p are not symmetric in x\ and x 2 . Even though constraint (|17p and the dual 
functional b possess this symmetry, the symmetry is being broken by imposing constraint (|18p on 
the second partial derivative with respect to x\. 

Remark 4.2. Perhaps of independent interest, the methodology explained in this section can be 
employed to search or optimize over a restricted family of psd polynomials that are not sos using 
sos-programming. In particular, we can use this technique to simply find more instances of such 
polynomials. In order to impose a constraint that some polynomial q{x) must belong to P n ^d \ £n,d> 
we can use a dual functional r\ G £* d to separate q{x) from £ nj d, and then require q{x)(^™ =l xf) r 
to be sos, so that q(x) stays in P n ,d- 
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A Rational SOS Decomposition of the Hessian 



In this Appendix we present an explicit SOS decomposition of the Hessian of the form in Theo- 
rem [3T2] Since the form ([9]) belongs to 7^3,10 ~ M? 20 (is a trivariate form of degree 10), we have 
exploited the Newton polytope of the form, as well as its symmetries in order to reduce the size of 
our certificate. For a description of general techniques for exploiting structure and symmetries in 
sum of squares programs, the reader is referred to \22\ l6j. 

For our example, note that only even powers of x±, X2, and x 3 appear in the form p(x) in ([7]). 
Therefore, p(x) is invariant under changing the signs of X\, X2, and x 3 . This induces a similar 
type of symmetry in the form (x\ + x\ + x 3 ) y T H(x)y in Q. Namely, the form is invariant under 
the group Z\, acting via the transformations (xi,yi) 1— > (— Xi, — yi) for i = 1,2,3. This symmetry 
enables us to solve a semidefinite program of considerably smaller size by splitting the monomials 
in isotypic components corresponding to the parity of exponents. More concretely, we can split the 
monomials of degree five into four groups Zi,Z2,z 3 , and Z4 that are associated with the exponent 
patterns (OEE), (EOE), (EEO), and (OOO). Here, E/O means even/odd, and corresponds to the 
parity of the monomials in terms of their combined degree in x and y, i.e., the monomials are 
classified in terms of their degree in {xi,y±}, {2:2,2/2}, and {2:3, 1/3}. For example, the monomial 
x\x\y2 has combined degrees equal to (2,3,0) and therefore belongs to the (E,0,E) group. 

An explicit sos decomposition of the form in ([9]) is: 



[x\ + x\ + xf) y T H(x)y = z\Q\Z\ + z%Q 2 Z2 + zjQ 3 z 3 + zjQ 4 z 4 , 
where the matrices Qi and monomial vectors zi are given by 

z i = Mh/i^i^is/i) xxxly 3 , xix 2 xly 2 ,xixlx 3 y 3 ,xixly2, x\xlyi,x\xlyi,x\x 3 y 3 , x\x 2 y2,x\yi} 
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